MPRA 


Munich Personal RePEc Archive 


Uninsurable Risk and Financial Market 
Puzzles 


Basu, Parantap and Semenov, Andrei and Wada, Kenji 


September 2009 


Online at https: //mpra.ub.uni-muenchen.de/23351/ 
MPRA Paper No. 23351, posted 17 Jun 2010 12:34 UTC 


Uninsurable Risk and Financial Market Puzzles 


By Parantap Basu, Andrei Semenov, and Kenji Wada* 


This draft: September, 2009 


Abstract 


Following Kocherlakota and Pistaferri (2009), we consider two forms of incomplete risk shar- 
ing in economies with consumer heterogeneity: (a) where agents are unable to insure their 
consumption against idiosyncratic skill shocks and (b) where idiosyncratic shocks to skills can 
be partially insured by striking long term insurance contract with truth revelation constraint. 
When considering the equity premium, currency premium, risk-free rate, and consumption- 
real exchange rate puzzles in an integrated framework, we find empirical evidence that al- 
though the pricing kernel associated with (a) outperforms the complete risk-sharing stochas- 
tic discount factor and the pricing kernel associated with (b), it is still unable to jointly 
resolve these asset-pricing anomalies. 
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1 Introduction 


There are four well known puzzles in the macro-finance literature. Two of these puzzles are home 
based. Mehra and Prescott (1985) and Hansen and Jagannathan (1991) among others observe 
that the covariance of aggregate per capita consumption growth with the excess return on the 
market portfolio over a risk-free asset is very low, so that the representative-agent consumption 
CAPM can explain the observed market premium only if the typical investor is extremely risk 
averse. This is known as the equity premium puzzle. In addition, Weil (1989) observes that 
given the lack of variability of aggregate consumption growth, the representative agent must 
have a negative rate of time preference for the model to be able to match the observed mean 
risk-free rate. This puzzle is known as the risk free rate puzzle. 

The other two puzzles appear on the international front. Economic theory predicts that the 
log real exchange rate growth between any two countries equals the difference in the logs of the 
foreign and domestic stochastic discount factors (Brandt et al., 2006). With a representative 
agent for each country, the log real exchange rate growth must be perfectly correlated with the 
difference in the log growth rates of marginal utilities of aggregate per capita consumption of 
respective countries. This implies that,under the standard assumption of power utility, the log 
real exchange rate and log relative consumption should be perfectly correlated. In practice how- 
ever it is observed that the correlation between relative consumption and the real exchange rate 
is close to zero or even negative. The real exchange rates are more volatile and persistent than 
the log relative consumption. This is the consumption-real exchange rate puzzle documented by 
Kollmann (1992) and Backus and Smith (1993). The final anomaly with the representative- 
agent model is that it is unable to reconcile the highly volatile excess return on currency with the 
smooth aggregate consumption growth rate unless the agent is assumed to bear an implausibly 
high level of risk aversion. This is the currency premium puzzle.! 

A number of recent papers departed from the complete market paradigm with full risk 
sharing to understand these puzzles.” In a recent influential paper, Kocherlakota and Pistaferri 
(hereafter KP) (2007) address the consumption-real exchange rate puzzle in a setting where 
markets are complete with respect to country-specific shocks (individuals can fully insure their 
consumption against cross-country shocks), but domestic markets are incomplete (individuals 
cannot completely insure themselves against idiosyncratic skill shocks). They consider two forms 


of partial insurance against idiosyncratic skill shocks. The first they call domestically incomplete 


‘Lustig and Verdelhan (2007) show that the representative-agent consumption CAPM can explain the cross- 
sectional variation in currency premia only if the representative agent’s coefficient of risk aversion is around 
100. They find that the estimate of the risk aversion coefficient does not change when the representative agent’s 
Euler equations for the currency portfolios are estimated jointly with the Euler equations for US domestic bond 
portfolios (sorted by maturity) and stock portfolios (sorted by size and book-to-market ratio). 

> Bewley (1982), Mehra and Prescott (1985), Mankiw (1986), Constantinides and Duffie (1996), and Brav et 
al. (2002), among others, argue that consumers’ heterogeneity induced by market incompleteness may be relevant 
for understanding these asset pricing anomalies. 


markets (DI). Under this formulation, individuals are unable to insure their consumption against 
idiosyncratic skill shocks. The second form of partial insurance they label as Private Information 
Pareto Optimal (PIPO). Here, the agents are able to sign insurance contracts, which allow them 
to insure themselves against idiosyncratic shocks, subject to the incentive constraint that agents 
reveal truth about their private skill shocks to the financial intermediary. For each form of 
partial insurance, KP (2007) derive a restriction relating the growth rate of the real exchange 
rate to the difference in the growth rates of the moments of the cross-sectional distributions of 
consumption in two countries. Using household-level consumption data from the US and the 
UK, they show that the PIPO model fits the data with the relative risk aversion coefficient of 
around 5, while the complete risk-sharing (CRS) model and the DI model both perform poorly. 
In another companion paper, KP (2009) demonstrate the superior performance of the PIPO 
model to explain the observed mean equity premium in the US with a value of the relative risk 
aversion coefficient between 5 and 6.° 

Although both papers of KP make a major methodological contribution to model consumer 
heterogeneity in the presence of uninsurable risk, there are two problems with their approach. 
The first problem is about the robustness of this approach to attack various puzzles. Ideally, 
one expects that the same economic fundamentals should be responsible for understanding home 
and international financial puzzles described earlier. KP focus on only one puzzle in isolation 
and the proposed pricing kernels do not work properly if two or more asset pricing anomalies are 
addressed together in an integrated framework. For example, the stochastic discount factors 
proposed by KP (2009) fail to explain the observed mean risk-free rate. Likewise, both these 
pricing kernels yield implausibly low estimates of the subjective time discount factor.* 

The second problem is about the robustness of KP’s estimation results. One of the problems 
with the pricing kernels proposed in KP (2207, 2009) is that the testing results are very sensitive 
to the sample design. Kollmann (2009) documents that the KP (2007) results are highly sensitive 
to the presence of outliers. He also finds that minor specification changes may overturn the KP 
(2007) findings. Kollmann reestimates KP models and find that the real exchange rate anomaly 
continues to persist. 

These two problems are related. If KP results are so sensitive to sample design, the issue 
arises whether the superior performance of the PIPO pricing kernel will survive when more than 
one asset pricing anomalies are addressed together in an integrated framework. In this present 


paper, we precisely address this issue. While deriving our stochastic discount factors, we address 


3 Constantinides and Duffie (1996), Brav et al. (2002), Semenov (2004), Basu and Wada (2006), and Balduzzi 
and Yao (2007) also argue that the model with heterogeneous consumers can help explain the excess return on 
the market portfolio over the risk-free rate with a plausible value of the relative risk aversion coefficient. 

‘The same problem arises in Brav et al. (2002). For example, the stochastic discount factors proposed in 
Brav et al. (2002) fail to jointly explain the observed mean equity premium and risk-free rate. At economically 
plausible estimates of the risk aversion coefficient required to explain the equity premium puzzle, both these 
pricing kernels yield implausibly low estimates of the subjective time discount factor. 


four asset pricing puzzles (the equity premium, risk-free rate, consumption-real exchange rate, 
and currency premium puzzles) simultaneously using an integrated framework. Thus we propose 
to get a unique set of estimates for the preference based fundamentals, namely risk aversion and 
time preference rate which could reconcile all four puzzles simultaneously. This requires careful 
modelling of international currency trades by motivating the transaction demand for currency 
in terms of a cash-in-advance constraint. In this respect, our approach is theoretically more 
elegant than KP (2007, 2009). 

Second, to check the robustness of the results of KP as pointed out by Kollmann (2009), 
we revise the data for the US and the UK. Our quarterly dataset ranges up to 2004 while KP 
dataset is limited only up to 1999. We compare the results of our dataset with KP. 

Finally, following the tradition of Constantinides and Duffie (1996), Sarkissian (2003), Basu 
and Wada (2006) and Semenov (2008) we use the assumption of lognormality of the cross section 
consumption process to work out closed form expressions for the pricing kernels for alternative 
market environments.’ These pricing kernels are empirically easily estimable using the Gener- 
alized Method of Moments (GMM) and are less vulnerable to the sampling and measurement 
error problems mentioned in KP. 

We perform a number of experiments to understand all four puzzles in an integrated frame- 
work. We take a first look at the asset pricing anomalies using a commonly used calibration 
approach. The key result is that a given pricing kernel is unable to explain all four puzzles. This 
necessitates the use of GMM to estimate the deep parameters using the moments conditions. 
We find that when Euler equations for the equity premium, currency premium and risk free 
rate and the Backus-Smith real exchange rate equation are estimated jointly, all three models, 
CRS, DI and PIPO pass the diagnostics and provide economically plausible estimates of deep 
parameters. In other words, we are unable to reject any of the three models. Finally, we put 
these three models to further scrutiny by testing their forecasting abilities. We find that the DI 
model slightly outperforms the other two in forecasting the real exchange rates. However, the 
forecasting ability is still far from satisfactory for all three models under scrutiny. 

The rest of the paper is organized as follows. In Section 2, we describe the DI and PIPO 
environments and derive the associated stochastic discount factors. Section 3 addresses the em- 


pirical implementation of the models derived in Section 2. In Section 4, we report the empirical 


5 The assumption of lognormality of the cross sectional consumption process is empirically well grounded. Bat- 
tistin et al. (2007, 2009) show that, within demographically homogeneous groups, the cross-sectional consumption 
distribution in both the US Consumer Expenditure Survey (CEX) and the UK British Family Expenditure Survey 
(FES) is approximately log normal. They argue that this is due to the fact that the logic of Gibrat’s law applies 
to consumption. Brzozowski et al. (2009) provide empirical evidence that the cross-sectional distribution of con- 
sumption within cohort groups in Canada may be very well approximated by a log-normal. Blundell and Lewbel 
(1999) also provide strong empirical evidence of log-normality of the cross-sectional distribution of consumption in 
a variety of data sets. Attanasio et al. (2004) assume log-normality of the cross-sectional distribution of household 
consumption when studying the evolution of inequality in consumption in the US both within cohorts and for the 
all population. 


estimation and testing results for these models, as well as investigate how these results may 
be affected in the presence of the measurement and sampling errors in individual consumption. 


Section 5 concludes. 


2 The DI and PIPO Environments 


Following KP (2009), we relax the assumption of market completeness and assume that although 
international markets are complete (individuals can fully insure their consumption against 
country-specific (aggregate) shocks), domestic markets are incomplete (individuals can only 
partially insure their consumption against individual-specific (idiosyncratic) skill shocks). To 
take into account this partial insurance against shocks, as in KP (2009), we consider two market 
structures: (a) where agents are unable to insure their consumption against idiosyncratic shocks 
and (b) where idiosyncratic shocks can be partially insured by striking long term insurance con- 
tract with truth revelation constraint (the PIPO form of partial insurance against idiosyncratic 
shocks). 

In this section, we first describe each of these two environments and then derive the associated 


stochastic discount factors. 


2.1 The DI Environment 
2.1.1 The Problem 


Our DI environment is similar to that in KP (2009) and Golosov and Tsyvinski (2006) except 
that, within our approach, we have explicit stock, bond, and currency trading. We assume 
that there are two generic countries in the world: a home country and a foreign country. We 
further assume that the economy is populated by infinitely many agents with ex ante identical 
preferences. The agents are not country specific in nature and only differ in private history of 
skill shocks. 

At any date t (t = 0, 1,2, ..., T), an agent experiences an idiosyncratic skill shock 6;, which is 
drawn from a finite set O. In addition, all agents are exposed to the same aggregate shock z that 
is drawn from an uncountable set Z. The date t private skill shock history 6‘ = (61, 02, ..., 0+) and 
public shock history zt = (21, z2, ..., 24) are the tth components of OT and ZT, respectively, with 
respective probabilities 7(0’) and w(z'). We assume that the idiosyncratic skill shock and the 
aggregate shock are drawn independently, so that by observing the aggregate shock one cannot 
infer anything about the idiosyncratic skill shock. According to the law of large numbers, at 
any date t there are exactly 7(6") agents with the private history 6°. 
a Nae 


Suppose that the home country produces two goods, tradable (y and non-tradable (y 


with the following technologies: 


yt (z", 6") = di (2", o*i, i = TR, NT, (1) 


where I} is the labour used in sector i and gi is the sector į marginal product of labour. Note 
that the labour productivities depend on the history of the public and idiosyncratic skill shocks, 
zt and 6*. Both 6° and li are private information to agents. 


The aggregate outputs of traded and non-traded goods for the home country are 


¥i(2') = So yi(2', 0°)n(6"), i= TR, NT. (2) 
ot 


Assume that there are the following assets: (a) two home stocks, which are claims to the 
nominal proceeds from traded and non-traded sectors, (b) a one-period nominal bond that 
pays a nominal interest rate of r;, and (c) the home country currency, which is traded in the 
international spot and forward markets. We further assume that only the spot and forward 
contracts on currency are traded abroad, while stocks and bonds are not internationally traded. 
The currency plays a twofold role: (a) as a means of exchange (specified as a cash-in-advance 
constraint) and (b) as a store of value (the same currency can be invested in the international 
spot and forward markets). Home and foreign goods are both non-storable.® 

Financial markets open before the goods market. At the start of the day, agents trade in 
stocks, bonds, and currency. Once the financial transactions are completed, a household takes 
the left over cash to transact in goods. Each household has two distinct entities: a shopper 
and a producer. As a producer, the household produces traded and non-traded goods, while 
as a shopper it purchases the same goods. Since in the market place there are infinitely many 
shoppers and producers and a shopper meets a producer randomly, a cash-in-advance constraint 
is necessitated. 


A home country agent faces the following optimization problem: 


{ulef®, ol) ji 


T 
Max eS" oP) o 
j=t ae 
s.t. 
mimi + 9, Q6 +b < 
i=TR,NT 
Ogata st (1+ ry-)bj (4) 
i=TR,NT j—1 j—1 
and 


DE mi. (5) 
i=TR,NT 


°In order to keep the equity premium puzzle a purely domestic financial puzzle, we assume that the stocks 
and the bond are non-traded assets. This is an extreme form of “home bias” documented by Tesar and Werner 
(1995), e.g. As a result, we rule out the possibility of earning the risk-free interest on the currency held from 
one period to another. This is a simplifying assumption to rule out complex term structure issues of returns on 
currency based on trading in foreign bonds. 


Here, c is the date j consumption of sector 7 goods, Pi is the date j nominal price of sector 
i goods, Q; is the date j sector i nominal price of new equity purchases, D; = Pj Y} is the date 7 
dividends from sector i,’ gi is the date j share of sector i, rj is the date j nominal risk-free rate 
of interest, m3 is the home money invested in the spot market at date j, mi is the home money 
committed to the forward market at date 7,° m$ is the currency used for purchase of goods at 
date j, bj is the risk-free bondholding at date j, F; stands for the date j forward exchange rate, 
S; represents the date j spot exchange rate. All prices are denominated in home money. The 
instantaneous utility function Mie: cl T) is assumed to be linearly homogenous as in Backus 
and Smith (1993) and function (IR, N T) is monotonically increasing in its arguments with 
usual regularity conditions as in KP (2009). EF; |-] is an expectations operator. Expectation is 
computed with respect to the probability measures of z'+! and 6*t!. Finally, 8 is the subjective 
time discount factor and y > 0 is the relative risk aversion coefficient. 

Non-storability of goods is a crucial assumption, which explains why foreign prices do not 
enter the flow budget constraint (4). If goods are storable, then home agents can convert foreign 
currency into foreign goods and enjoy capital gains when foreign prices rise in the following 
period. 

Since within the DI framework agents are unable to insure themselves against idiosyncratic 
skill shocks, all date t prices, interest rate, and exchange rates are functions of public history 
of shocks z’ only. The crucial assumption here is that stocks and bonds do not hedge the 


idiosyncratic skill shocks. In this respect, the markets are domestically incomplete. 


2.1.2 First-Order Conditions 


The Lagrangian for the above optimization problem is 


T . u(t eae eo T B 
L = E |X pt : j ERAD] +B Slm SD Pd 
J= k j=t i=TR,NT 
T i et hey Sims _ 4 Fjaimf 
DE oat QE 4 — Que. l SE i 
+E» à; tee jSj—1 jSj—1 J i) Sj-1 Sj-1 (6) 
j=t +(1 + rj—1)bj—1 bj ms mf ms 
The corresponding first-order conditions are: 
É : Uy Uei == mP, 4 = TR, NT, (7) 
I; : Vy = mP, i =TR,NT, (8) 


TNote that there is no labour market. Agents supply their own labour and thus the dividends are simply the 
proceeds from the sale of outputs in the goods market. 

8 Although money is committed to the forward market at date j, no money literally changes hands until date 
j +1 by the very definition of a forward contract. The home investor is free to take a long (m; > 0) or short 
(mj < 0) position at the contracted forward rate F; that is binding at date j + 1. 


Ei MQ + Et [Agi (Qiy1 + Di, 1)] =0, i= TR, NT, (9) 


be: At + Ee (L + re) Ati], (10) 
S 
me: —M + Et [aes | = 0, (11) 
F, 
mf : —At + Er he =0, (12) 
t 
my > — At + Ht = 0. (13) 


Here, the subscripts of u and v state for the partial derivatives of u and v with respect to 
the relevant argument. The first-order condition (13) means that the agents allocate money 
for transaction purpose so as to equate the marginal benefit of transaction to the marginal 
opportunity cost of the foregone earnings from currency trading. 

Based on the first-order conditions (7) and (8), it is straightforward to verify the following 


static efficiency condition for the labour supply decision: 


UTR ok UTR 


elas 14 
unr gr? VNT = 


which shows the equivalence between the ratio of marginal disutilities of labour and the corre- 


sponding marginal utilities from consumption in each sector. 


2.1.3 Monetary Policy and Initial Distributions of Assets 


Monetary policy within this framework represents an initial cross-country distribution of money 


stocks, namely home money, Mo, and foreign money, Mg, to fix the date 0 spot rate such that 
Mo = So Mě.’ (15) 


In other words, central banks in both home and foreign countries coordinate monetary poli- 
cies in such a way that the initial spot rate So is pinned down. After this, the central banks let 
the nominal exchange rate float according to currency trading among countries. 


The initial distributions of stocks and bonds are such that 


g? 
X bo(0, 2°)r(0°) = 0, (17) 
9° 
and 
S "m9 (6°, 2°) (6°) = Mo (18) 
9° 


°Hereafter, the asterisk denotes the foreign country. 


2.1.4 Composite Good and Price 


Following KP (2009), it is convenient to reduce the two good setting to a composite good 
problem. Exploiting the linear homogeneity of the instantaneous utility function and the duality 


property, we can write: 


Pes So. OP, (19) 
i=TR,NT 


where P, is the minimum expenditure required to attain one unit of utility. 
That is, 


pe x . ii 
P, = arn D Prey (20) 
t> i=TR,NT 
s.t. 
ule, ye) = 1, (21) 


which means that instantaneous utility u(c/*, cN) is nothing, but the real consumption expen- 


diture or a composite consumption good that we label cq; hereafter. 
TR „NT 
Ga) 


Based on this composite consumption and exploiting the linear homogeneity of u( F 


equations (7) and (13) can be combined to obtain 
Ct: Bae! = MP =(); (22) 


2.1.5 Equilibrium 


In equilibrium, the following market-clearing conditions must hold. Given the assumption that 
stocks and bonds are not internationally traded, the stock and bond markets must domestically 


clear meaning 


Mie, z')n(0") = 1 for each i (23) 
ot 


X :(0", 2!) (8°) = 0. (24) 
0t 


The other market-clearing conditions are the traded and non-traded market clearing condi- 


tions 


D (Er, 0) + GTE, nO) = A uE, 0) + iT, O°) ao) (25) 
ot 


gt 
doh (zt, 6°) (8°) = So ut? z, 6°) (8), (26) 
ot ot 
and the currency market clearing conditions for spot and forward are 


X (me (0, 2')r(8") = SeX mý (0*,z*)n(0*).for all 2! (27) 


` 
È 
— 
S 
+ 
N 
= 
— 
I 
4 
es 
~ 
QL 
= 
N 
= 
7 
I 
j=) 
T 
laz} 
o 
E 
N 
= 


(28) 


The spot rate must be such that the supply of the home currency supplied on the spot exactly 
equals the corresponding demand. This explains the spot market clearing condition (27). The 
forward rate at each date is contracted in such a way that agents taking long and short positions 
in the forward market balance each other for all aggregate history zt. This explains the forward 


market clearing condition (28). 


2.1.6 The DI Pricing Kernel 


Assume that the world equilibrium, as laid out in the preceding section, exists. Define the real 
returns on the traded and non-traded stocks as 
atda. 
Ria SHH! STENT, (29) 


di 


where gi(z') = Qi(z')/P; is the real ith equity price and di (zt) = Di,,(2'*?)/Pi41 is the 
real dividend from share 2. 


Based on these two returns, we can define the market portfolio return as 


Ruei= >> Ruger (30) 
i=TR,NT 


Define the real risk-free rate as 


(1+ re41) Pt 


i (31) 
Pry 


RFt+1 = 


Based on the first-order conditions (9) and (10), we can thus write the first-order condition 


for the real market portfolio return as 


Art Pipa | 
Ey | —— — R =1 32 
(| rn (32) 


and the first-order condition for the real risk-free rate as 


Atti Prat | 
FE, | — — R =I. 33 
| resp (33) 
Likewise, the first-order conditions (11) and (12) give the spot and forward rate equations 
as follows: ee: 
fg poe ee 34 
: | Xe S (34) 
and TE 
PA (ics aaa E] 35 
ea (35) 
or, equivalently, 
Ae Pr Sti P 
z| ttl d tl Dt+l | =4 (36) 
At Pe aP 


and 2: Z 
Ati Pi FP 


A Pi St Pei 
From (32), (33), (36), and (37), it follows that within the DI framework the stochastic 


discount factor is 


E; Si (37) 


SDFP! = re Ey D (38) 

In the next step, we follow the same principle as in KP (2009) to rewrite the stochastic 

discount factor (38) in terms of the cross-sectional moments of consumption. Define a generic 

asset n with the real gross return Rp, which is a function of the aggregate shock history z* only. 
In the present setting, this asset can be an equity, risk-free bond, currency forward, or spot. 


Using (22), the Euler equation for such a generic asset can be written as 


c(0t, zt) = BY Rne awe 2) Y ery (0ttt, zt Taoto). (39) 
zt+t1 gttl 
Define 
Elh lz, 0°) = XO cea (01, z+) 10110") (40) 
gttl 


as the — 7th non-central cross-sectional moment of composite good consumption conditional on 
private history 6’ and public history z‘+!. 


Thus, equation (39) can be rewritten as 


c:(0",z')-7 = B > Rnapi(z ba 2) Ela gle, 0°). (41) 


gttl 


Integrating the both sides of (41) over 6’ and using the law of iterated expectations, we get 


E(c,7|z") =B `> Raapil lz t zE lalz) (42) 


gttl 


or, equivalently, 


E(cp lz?) 
fe =1, 43 
p E(c "|z") n,t+1 ( ) 
which immediately shows that 
Ae Pepa E(o,44|2"*") 
SDFD! = a a 44 
t+1 X P p Ele’ |zt) (44) 


is the stochastic discount factor associated with the DI environment. 

This stochastic discount factor differs form the pricing kernels proposed by Constantinides 
and Duffie (1996), Sarkissian (2003), and Semenov (2008) in that it uses the cross-sectional 
distribution of consumption in level, rather than the cross-sectional distribution of consumption 


in growth rates, as the driving process for the pricing kernel. As we mentioned above, the use of 


10 


the cross-sectional distribution of consumption in growth rates has a counterfactual implication 
that the cross-sectional distribution of consumption in level is not stationary, implying that the 
Gini coefficient of consumption distribution goes to infinity. 

We are now ready to write the relevant excess return equations, which are namely the equity 
premium and currency premium equations. Subtracting (33) from (32), we get the following 


real equity premium equation: 
Fr; [SDFË} (Ruts — Rrtsi)] =0. (45) 


Subtracting (36) from (37), we obtain the real currency premium equation: 


E; [spre (4 ? e) 5 +| =0. (46) 


The real risk-free rate equation (33) is 
E; (SDFEA Reel =1. (47) 


Because the log real exchange rate growth between any two countries is equal to the difference 
in the logs of the foreign and domestic stochastic discount factors (see Brandt et al. (2006)), 


within the DI framework 


Aq = In (+) = In (SDFP4") — In (SDFP). (48) 


In equations (45)-(48), SDFẸ} is as defined in (44). 


2.2 The PIPO Environment 
2.2.1 The Problem 


In this alternative setting, agents are able to partially insure against idiosyncratic skill shocks. 
The model is a dynamic extension of Mirrlees (1971) type private information setting. Trading 
convention is similar to that in Golosov and Tsyvinski (2006) and KP (2009). All agents are 
assumed to have ex ante identical preferences. There is a continuum of insurance firms, which act 
on behalf of the households and play the following roles: (a) produce the traded and non-traded 
goods by hiring workers, (b) sell these goods in national and international markets, (c) trade 
among themselves in stock, bond, and currency in sequential markets, and, finally, (d) with the 
resulting profits from this trade insure the households against idiosyncratic skill shocks. Timing 
of financial and goods markets is the same as in the DI setting. The same cash-in-advance 
constraint applies to the insurance companies when they trade in goods. 

The insurance firms are owned equally by all agents. At date 0, before the realization 
of aggregate and idiosyncratic shocks, the contract market opens only once. In this market, 


the competitive insurance firms offer contracts to the households about consumption bundles 


11 


of traded and non-traded goods {c7£,cNT}, which provide maximum ex ante utility to the 
households. Since the insurance company does not observe the idiosyncratic shock history and 
labour supply, it stipulates contract about the observed output sequence of traded and non- 
traded goods {yp Royn T), such that it is incentive compatible for the agents to reveal the truth 
about the history of private skill shocks. These contracts are long-term contracts with full 
commitment on both sides. After the contract market closes, from date 1 onward the insurance 
firms start trading in goods and financial markets in the same sequential manner as within the 
DI framework. 

A typical insurance company, located in the home country, maximizes the present value of 


the nominal payoffs to its owners: 


T. 4 
Ho ICETA C (49) 


{ef el ve YER yNT E bime, mf} t=0 i=1 


s.t. 


e(z) + mj (z) + mz (2) + mez) + JO QIE) + blz) < 


i=TR,NT 
sae) EAD E gae + meyer) 
i=TR,NT gt i=TR,NT ae) 
sz Ca) Oe ae t-1 t-1 
Sı (zt 1) 1(Z J+ 4 ri(z ))b: 1( ), (50) 


the cash-in-advance constraint 
SO X (OPE (a ilO, z) < melz’), (51) 
ot i=TR,NT 

the participation constraint 


t 0t, zt), cl 1—y TR Bt gt) yNT(6t, zt ht 
ye > | a eha Or al o (Sm 7) ae ELY | muta!) > u 
0t zt 


and the incentive constraint 


: ROL a) NTO z (EPO) NTON] gyi 
z zi Teg e aa | OMe > 


Rl zt), oT (0, z) yE (O',2') yi" (0, zt) 

3 ay [a e n(Y), (53) 
0t zt t ý A 

where II;(z‘) is the date t cash flow of the insurance firm contingent on the shock history zt, 

p,(z") is the zt contingent discount rate, and 0! is the history of shocks that the household reports 

to the financial intermediaries. Since the insurance firm does not observe the idiosyncratic shock 


history, all its relevant choices depend on the aggregate shock history zt. 
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2.2.2 First-Order Conditions 


Let Az (2), (zt), w:(z*), and 7,(z’) be the Lagrange multipliers associated with (50), (51), (52), 
and (53), respectively. The first-order conditions for problem (49) through (53) are as follows: 


+): [ [G+ az) — Az) = 0, (54) 

i=] 
ci : B'(we(2") +m (z) u; uc = plz*)P}, i =TR,NT, (55) 
yi : 6 (wilz*) au: m(z")) uy = A(O, z')éi_ (2) bi (2", 0°) Pi, t= TR, NT, (56) 

Ei : -Qi (z) A(z yz") + D (Qua (z?) + Dyyr(z**")) tai (2*)y(z'*") = 0, i= TR, NT, 
f (57) 
bi: A(Z) + D Aal + r(t) = 0, (58) 
-a +E a E yah), (59) 
AY) + Yo ala) AED (att), (60) 
and 

mẹ : —d2(2*) + u (z*) = 0. (61) 


A few clarifications are in order. Based on (54), the Lagrange multiplier A; represents the 
date 0 state claims price of a dollar to be delivered at date t contingent on z’. Because of (61), 
this state claims price is the same as the marginal transaction benefit of a dollar, ju,(z’). It can 
be seen that the other first-order conditions are similar to those in the DI setting. Note that 


the use of (55) and (56) yields the same static efficiency condition as (14). 


2.2.3 First-Order Conditions 


The monetary policy and the initial distributions of assets are the same as those described by 
equations (15) through (18) in Section 2.1.2. 


2.2.4 Composite Good and Price 


As within the DI framework, we can reduce the two good setting to a composite good problem 
described by equations (20) and (21). This means that the two goods can be reduced to a 


composite good c; with an associate composite price P; so that the following equality holds: 


Pig= X Pihal, z). (62) 


i=TR,NT 
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2.2.5 Equilibrium 


Following Kocherlakota (2005), we can show that the equilibrium allocation {c7 £, eN7, yP?, yNT} 
for this decentralized economy solves a constrained social planning problem, where the con- 
straints involve the truth revelation incentive constraint. Because of this optimality, KP (2009) 
call this allocation Private Information Pareto Optimum. In equilibrium, the market-clearing 
conditions (23) through (27) hold. 


2.2.6 The PIPO Pricing Kernel 


From (54), we obtain the following useful relationship between the Lagrange multipliers and the 
stochastic discount factor: A i 
t+1 
= . 63 
At 1+ pyy1(z'**) oe 
Using (61) and (63), and defining the conditional probability ~(z‘*!|z‘) = y(ztt!)/y(z*), 


we get 


Qiz) + Di, (zt) w(at* |) 


=1,¢=TRNT. 64 
i Oia T+ pia) G 
Likewise, using (58), (59), (60), and (63), we obtain the following equations: 
(zit |z*) 
14+ reai(zé = 1 65 
2a t+1( )) ee P41 (ze!) ( ) 
t+1 t+1)pt 
gttl S:(z ) 1+ Piyı(Z + ) 
and F(z t+1|,¢ 


geri Se(z") 1+ pegal zT!) 

To characterize the discount rates p,(z'), we follow Kocherlakota (2005) and Golosov et 

al. (2006). Fix the date t history 6t and zt. Decrease the composite good at date t for this 

history group by an infinitesimally small amount GA; and increase across the board the date 

t+ 1 composite good by A;. This compensating variation leaves the objective function and the 

incentive and participation constraints unaffected. It only impacts the resource constraints. The 

insurance company now makes sure to minimize the cost of resources at dates t and t+ 1 for all 
possible evolutions of the private and public shocks. 


To solve this problem, define 


T (0t, zt)! ci(0t, z')1-7 
= A 
~ a - Ba, (68) 


and t+1 -t+1)1 t+1 -t+1)1 
DAE ts, OTE gr Ve? 


1-y¥ - 1-y¥ 


Coral + Ab. (69) 
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The insurance company thus chooses A; such that the cost of resources at dates t and t+ 1 
evaluated at the respective state claims prices \;(z’) and Az41(z't!) is minimized at A; = 0. 
Using the flow resource constraint (50) and (62), this cost minimization problem can be rewritten 


as 
Min d4(2'YPe(2*) (c4(8", 2")? — B(1 -= 7) Ax) aO) 


+ Yes (241) Py (zt) T C a H= 7)A1) 1/(1-7) 7(6t*), (70) 
gttt 

The first-order condition with respect to A; evaluated at A; = 0 and the use of (54) and 

(61) yield the following inverse Euler equation: 
BPi(2*)c; (0°, 28)1 (0°) = (1 + pepi (zt) t Pipa (zt?) 5 cp (Ot, zit!) (9), (71) 

gttl 

Next, first integrating the right-hand side of (71) with respect to tt! for a given 6° and 
then integrating the left-hand side of (71) with respect to 6’, and applying the law of iterated 


expectations, we get 


BP (2!) B( el | 2*) = (1+ pega (2) Pr (2B (cha 2) (72) 
or, equivalently, E 
1 = E(¢ | zt) P,(z*) 
1+ pry (zt) EG ea Zit!) Pisa (gett). 


Plugging (73) into (64) through (67), we obtain that within the PIPO framework the sto- 


(73) 


chastic discount factor is 


SDFPIPO _ 1 Pial) =g E(c, | zt) (74) 
one 1+ pegs (zt!) Palzt) E(cp,,| z+) 


With this pricing kernel, the Euler equations for the equity premium, the risk-free rate, and 


the currency premium can be rewritten as 


E; (SDFLAP° (Ruts — Rrt+i)| = 0, (75) 
E; [SDF Rri] = 1, (76) 
i Roga P 
Et [spRnyre ( — =) — | = 0, (77) 
St Pry 


respectively, with SDE O defined as in (74). 


The log real exchange rate growth between two countries equals the difference in the logs of 


the foreign and domestic pricing kernels and hence within the PIPO framework 


Agua = In (SDFAPC) — in (SDF O). (78) 
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3 Empirical Formulation 
3.1 Consumption Process 


In a similar spirit as in De Santis (2007), we assume that the log of individual consumption is 
the sum of the log of aggregate per capita consumption and the uninsurable consumption due 


to the idiosyncratic uninsurable skill shock: 


Chk,t = Ck,t + In (Önk), (79) 


where Chk = In (Chk) is the log of the date t consumption of the Ath investor in country k, 
Ckt = In (Ck t) is the log of the aggregate per capita consumption in country k at time t, and 
Onk,t is the idiosyncratic consumption shock. 


We assume the following process for 6p 4,1: 


VATh (Chk,t) 
se) (80) 


Onk,t = CZD (nn varn (Chk,t) — 


where varp (Chp t) is the cross-sectional variance of log consumption and up, t is a standard 
normal shock, which is independently and identically distributed across countries, individuals, 
and time. 

The sth non-centred moment of the cross-sectional distribution of consumption is given by 


2 


s-s 
En (Chpt) = Cerexp ( var p, (ens) (81) 


Note that, by construction, the aggregate consumption is the sum of individual consumption, 
what can be checked by setting s = 1. Therefore, this log normal process satisfies the feasibility 
condition. To check whether this log normal process satisfies the optimality conditions, we follow 
a reverse engineering approach. If we can find a pricing kernel that supports this allocation of 
country k consumption and is also independent of the agent’s private history, then it must be 
satisfying the individual optimality conditions. 

A problem with using the KP (2007, 2009) DI and PIPO stochastic discount factors is that 
extreme observations in the right tail of the cross-section distribution of individual consump- 
tion may dramatically affect the high-order non-centred cross-sectional consumption moments 
(especially when the order of the moment, y, is high (about 5 or 6), as in KP(2007, 2009)) and 
hence the model estimation results.!? In section 4.5, we investigate whether the pricing kernels 
constructed using the assumption of the cross-sectional log normality of individual consumption 


are sensitive to measurement and sampling errors in the consumption data. 


See, for example, Kollmann (2009), who shows that the use of the winsorized cross-sectional moments (that 
are less sensitive to measurement error in individual consumption) overturns the KP (2007) regression results. 
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3.2 Pricing Kernels 


Plugging (81) into (44) and evaluating at s = —y, we obtain the following pricing kernel for the 


DI environment from the home country’s perspective: 


Cai.” + 
SDREL =B ( ta) exp (2 n Avarp (enea) ) . (82) 


where C; is the home country aggregate per capita consumption at time t and Avar'p (Ch t41) = 
VATh (Cht+1) — Varn (Ch,t). 
Likewise, plugging (81) into (74) and evaluating at s = y, we get the stochastic discount 
factor associated with the PIPO environment from the home country’s perspective: 
Cay -a 
SDF? =£ ( a) exp (2 = Avarp (nen) ; (83) 
t 
These specifications of the DI and PIPO pricing kernels are different from those in KP (2007, 


2009) and are new to the literature. As we show in Section 4.5, in contrast with the specifications 
in KP (2007, 2009), the above stochastic discount factors are robust not only to the observation 
error in individual consumption, but also to sampling error, what makes the model estimation 
and testing results more reliable. 

Under the assumption of complete risk sharing, within each country the agents are able to 
equalize state-by-state their IMRS and hence their optimal consumption growth rates. This 
implies that for both the home and foreign countries the cross-sectional variance of household 
consumption is constant over time and hence both the DI and PIPO stochastic discount fac- 
tors for each country reduce to the pricing kernel in the representative-agent framework (the 
discounted aggregate per capita consumption growth rate in the respective country raised to 
the power of the negative utility curvature parameter).!! The stochastic discount factor for this 
environment from the home country’s perspective, for example, is: 


SDFCRS — Cut) 4 
t+1 =< B Ce : (8 ) 


4 Empirical Investigation 


In this section, we assess empirically the potential of the DI and PIPO pricing kernels derived 
in the previous section to account for the observed fluctuations of the real exchange rate, equity 
premium, risk-free rate of return, and currency premium and compare the performance of these 
models with the CRS model. Two countries under investigation are the US (treated as the home 


country) and the UK (treated as the foreign country). 


'l When the consumption growth rate is the same for each agent, we can write Ch, t+1 = aCn,z for each h, where 
a is a constant. This implies that varn (Cn,t+1) = varn (In (a) + cnt) = varn (Cn, t) and hence Avary (cn,t41) = 0. 
Clearly, the same is true for the foreign country, i.e., Avaran (Geeew) =0. 
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4.1 Data 


Consumption. As is conventional in the literature, the consumption measure used in this 
paper is consumption of nondurables and services. For the US, data on household quarterly con- 
sumption of nondurables and services are from the US Consumer Expenditure Survey (CEX), 
produced by the Bureau of Labor Statistics (BLS). For each household, we calculate quarterly 
consumption expenditures for all the disaggregated consumption categories offered by the CEX. 
Then, we deflate obtained values in 2005:Q1 US dollars by the Consumer Price Indexes (CPI’s) 
(not seasonally adjusted, urban consumers) for appropriate consumption categories. !? Agere- 
gating the household’s quarterly consumption across these categories is made according to the 
National Income and Product Account (NIPA) definition of consumption of nondurables and 
services. 

Following Brav et al. (2002), in each quarter we drop households that do not report or 
report a zero value of consumption of food, consumption of nondurables and services, or total 
consumption. We also delete from the sample the nonurban households, the households residing 
in student housing, the households with incomplete income responses, the households that do 
not have a fifth interview, and the households whose head is under 19 or over 75 years of age. 

To calculate the household’s quarterly per capita consumption, we divide the quarterly con- 
sumption expenditure of each household by the number of people in the household in that 
quarter.!3 The within-country consumption variance for each quarter is then calculated as the 
cross-sectional variance of the log household’s quarterly real, per capita consumption. Because 
of the poor quality of the CEX data before 1982, the sample period is from 1982:Q1 to 2004:Q4. 

The US data on quarterly seasonally adjusted US dollar nominal aggregate consumption of 
nondurables and services are from the US Bureau of Economic Analysis (BEA). The real aggre- 
gate consumption of nondurables and services is calculated by dividing the nominal seasonally 
adjusted aggregate consumption of nondurables and services by the CPI (2005:Q1=1) for non- 
durables and services (from the BEA). The US aggregate per capita consumption is calculated 
by dividing the real aggregate consumption of nondurables and services by the US population 
(from the BEA). 

For the UK, we use the UK Family Expenditure Survey (FES), a voluntary survey of a 
random sample of private households in the UK, conducted by the Office for National Statistics 
(ONS).!4 The data of approximately 6,500 households are collected throughout the year to cover 


seasonal variations in expenditures, with either the week or month, in which the fieldwork is 


12 The CPI series are obtained from the BLS. 

13T Section 3.1, we made the assumption that the individual consumption expenditures are log normally 
distributed in the cross-section. We use the Jarque-Bera statistic to check this assumption for the US. The 
evidence is that in the CEX the assumption of the cross-sectional log normality of the household’s quarterly per 
capita consumption is not rejected statistically at the 1% significance level for a large majority of quarters. 

'4Tn April 2001, the FES was replaced by the Expenditure and Food Survey (EFS), which also covered the 
National Food Survey (NFS). 
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carried out, being randomly assigned to each individual household. Of the data available in the 
FES, we use the diary records of daily expenditure, kept for two weeks by each individual aged 
16 or over in the household survey. 

Using these diary data, the cross-sectional variance of the log household’s quarterly real, 
per capita consumption of nondurables and services for each quarter is computed as follows.!° 
First, we calculate the household-wide consumption of nondurables and services by adding the 
consumption only of nondurables and services (measured in UK pounds) for each individual in 
the household. The definition of nondurables and services follows that of Attanasio and Weber 
(1995). Second, given that the household consumption data are for the two week durations only, 
we multiply them by 6.5, so that the data are converted into quarterly frequency. Third, we 
divide the quarterly consumption expenditure of each household by the number of people in the 
household in that quarter to derive quarterly nominal, per capita consumption of nondurables 
and services. Fourth, we categorize the household consumption data into four quarterly groups, 
based on the quarter or month the survey was conducted for the household. By dividing the 
data by the quarterly CPI for all items (not seasonally adjusted)!® with the basis of 2005:Q1, 
the quarterly real, per capita consumptions are calculated. Finally, we take the logarithms 
of the quarterly real, per capita consumptions calculated in the previous step, followed by 
the calculation of the cross-sectional variance of the log household’s quarterly real, per capita 
consumption for each quarter. 

The UK data on seasonally adjusted nominal aggregate consumption of nondurables and 
services are from the ONS and the UK Data Archive (UKDA). The UK real aggregate con- 
sumption of nondurables and services is calculated by dividing the nominal seasonally adjusted 
aggregate consumption of nondurables and services by the CPI (2005:Q1=1) for nondurables 
and services (from OECD main economic indicators). We calculate the UK aggregate per capita 
consumption of nondurables and services as the real aggregate consumption of nondurables and 
services divided by the UK population (from OECD main economic indicators). 

When testing alternative pricing kernels, we use two different sets of household-level data 
on consumption expenditures from the US and the UK. The first set (we call it the BSW data 
set) is described above (as we mentioned above, due to the poor quality of the CEX data before 
1982, this data set covers the period from 1982:Q1 to 2004:Q4). The second one is the data set 
used in KP (2007, 2009) (the KP data set, hereafter), which covers the period from 1982:Q1 to 
1999:Q4.'" We refer the reader to KP (2007, 2009) for the description of their data on household- 


level consumption expenditures. Because the consumption definition and the sample selection 


15 Our procedure mimics KP (2009). 

16 The CPI is from the OECD main economic indicators. 

'" When using the data on household-level consumption expenditures from KP (2007, 2009), we do not consider 
the data for 1980 and 1981 due to their poor quality in the CEX. As a result, the data set we use in estimation 
is two years shorter than the data set used by KP (2007, 2009). 
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procedure in KP (2007, 2009) are different from ours,'® we believe that estimating the same 
models using these different data sets on consumption expenditures would provide a good check 
of whether the estimation and testing results are robust to the used measure of consumption 


and sample design. 


The Spot and Forward Exchange Rates. The nominal spot, S;, and 3-month forward, Fy, 
US Dollar to British Pound currency exchange rates are from DATASTREAM (series XUDLUSS 
and XUDLDS3, respectively). The real spot exchange rate, Q+, is calculated as 


_ SCPH} 


Qt = “CPI,” (85) 


where CPI; and CPI; are respectively the US and UK CPI’s (2005:Q1=1) for consumption of 


nondurables and services. 


Asset Returns. We use three different proxies for the market portfolio return. The first two 
are the value-weighted, Ryw, and equal-weighted, Rgw,+, returns (capital gain plus dividends) 
on all stocks listed on the NYSE, AMEX, and NASDAQ. The data on the nominal quarterly 
value- and equal-weighted returns on all stocks listed on the NYSE, AMEX, and NASDAQ for 
the period from 1982:Q1 to 2004:Q4 are obtained from the Center for Research in Security Prices 
(CRSP) of the University of Chicago. We also view the market portfolio as consisting of five 
industry portfolio return indices. The nominal quarterly value-weighted returns on five NYSE, 
AMEX, and NASDAQ industry portfolios ((a) consumer durables, nondurables, wholesale, retail, 
and some services (laundries, repair shops), R14, (b) manufacturing, energy, and utilities, Roz, 
(c) business equipment, telephone and television transmission, Rg, (d) healthcare, medical 
equipment, and drugs, Raz, and (e) other, Rs +) are from Kenneth R. French’s web page. 

The risk-free rate is the 3-month US Treasury Bill secondary market rate on a per annum 
basis obtained from the Federal Reserve Bank of St. Louis. In order to convert from the annual 
rate to the quarterly rate, we raise the 3-month Treasury Bill return on a per annum basis to 
the power of 1/4. 

The real quarterly returns are calculated as the nominal quarterly returns divided by the 3- 
month inflation rate based on the deflator defined for consumption of nondurables and services. 
We calculate the equity premium as the difference between the real equity return and the real 
risk-free rate. 


Table I reports the descriptive statistics for the data sets used in estimation. 


| Table I | 


18 As the measure of consumption, KP (2007, 2009) use non-durable consumption and not consumption of 
nondurables and services, as in our case. In this paper, we apply the household selection criteria used in Brav et 
al. (2002), which are different from the selection criteria in KP (2007, 2009). 
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4.2 A First Look at the Financial Market Puzzles 


Perhaps the simplest way to assess the plausibility of a candidate stochastic discount factor 
is to assume joint conditional lognormality and homoskedasticity of the variables in the Euler 
equations and then to analyze the restrictions on asset returns implied by the pricing kernel. 
In the CRS specification, joint conditional lognormality and homoskedasticity imply that the 
expected equity premium is linear in the variance of the market portfolio return and covariance 


of the return on the market portfolio with consumption growth: 


1 
Ex [rmt+ — Tft] = — var (rueti) + yeou (T M,t+1;, Act+1) , (86) 


where rjyyt41 is the log return on the market portfolio, rft+1 is the logarithm of the risk-free 
rate of return, and Ac: is the logarithm of the aggregate per capita consumption growth rate 
between t and t+ 1. 

The risk-free rate obeys 


2 
reg = in (b) — Tvar (Act) + qE [Act]. (87) 


P P s š . 
Denote In ( P, ) as $1441 and In ($a ) as $9411. With these notations, we can write 
StPt+1 ; StPt+1 


the equation for the currency premium as 


Es [s1441 — s2441] = —5 (var (s1t41) — var (s2,t41)) +7 (cov (s1,t+1, ACt+1) — cov (52,441, ACt+1)) 
(88) 
Under the assumption of complete risk sharing, the equation for the log of the growth rate 


in the real exchange rate is 
Aq = 7 (Act — Ac) i (89) 


Substituting the moments in Table II into equation (86) shows that a relative risk aversion 
coefficient of 593.68 is required for the CRS model to explain the observed mean excess log 
return on the market portfolio. As follows from equation (87), this high value of the risk 
aversion coefficient y implies a subjective time discount factor of 2.34. These results illustrate 


the equity premium and risk-free rate puzzles. 
| Table II | 


One can appreciate the currency premium puzzle by examining equation (88). The moments 
reported in Table II imply that the expected currency premium predicted by the CRS model is 
much higher than the observed value when y is 593.68. To fit the observed mean log currency 
premium, the representative agent must be highly risk seeking. This is the currency premium 


puzzle. 
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From equation (89), it follows that the log real exchange rate growth must be perfectly 
correlated with the difference in the log growth rates of aggregate per capita consumption 
of respective countries. However, as it can be seen from Table II, the correlation between 
relative consumption and the real exchange rate is negative. This illustrates the consumption- 
real exchange rate puzzle. 

For the PIPO stochastic discount factor, with joint conditional lognormality and homoskedas- 


ticity of the variables in the Euler equations, we obtain for the log risk premium: 


ee) 
T cov (rm1, Avarn (cht+1)) 
(90) 


1 
Ei [rM t+ — Tft] = ~ guar (rut+i)+ycov (rut+1, ACt+1)— 
and for the log risk-free rate: 


E; [Avarp, (cat+1)] 


2 
rut = —in(b)-— Tvar (Aci) + 7E [Act] — 


(=a) 
2 


_(-7)’ 


3 cov (Acr41, Avarp (Ch t+1))- (91) 


var (Avarp (cnt41)) + 


At economically realistic (below 10) values of the risk aversion parameter, the CRS model 
yields the mean log excess return, which is too low compared with the value observed in the 
data. Because, as follows from Table II, cov (rM +1, Avarn (Cht+1)) > 0, we may expect the 
PIPO model with y > 1 to generate (at the same values of y) a higher log equity premium and 
hence to fit the observed log excess return on the market portfolio over the risk-free rate with a 
lower, compared with the CRS model, value of the relative risk aversion coefficient. Using the 
data from Table II, we obtain that the PIPO model can replicate the mean log equity premium 
with y equal to 29.89, what supports the intuition above. 

However, at this value of y, the mean log risk-free rate generated by the PIPO model is 
much lower than the observed value, implying a subjective time discount factor, required to fit 
the observed log risk-free rate, close to zero.!® This is due to the fact that, as reported in Table 
II, E: [Avar (ch t+1)] < 0 and cov (Act+1, Avarp (Ch t+1)) > 0, implying, according to equation 
(91), that uninsurable risk drives down the log risk-free rate yielded by the model when y > 1. 
Therefore, although the PIPO model can lower the value of risk aversion required to explain 
the mean log equity premium, this model tends to produce an implausibly low mean log real 
interest rate. 


The PIPO model implies the following restriction on the currency premium 


Fr |sit+i — sat] = -3 l (var (siti) — var (52,241)) 
+y (cov hes Act+1) — cov ($2,t41, Act+1)) (92) 
me a (cov (81,441, Avarp (Ch t+1)) — cov (s241, Avarp (Cht+1))). 


19KP (2009) also emphasize this problem with the subjective time discount factor in the PIPO framework. 
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For the data set under consideration, the difference between cov (51,441, Avarp (Ch,t+1)) and 
cov (S241, Avarp (cnt41)) is negative and hence, with y > 1, the PIPO model yields the mean 
log currency premium that is lower, at the same value of y, than the mean log currency premium 
in the case of complete risk sharing. The mean log currency premium increases as the risk 
aversion coefficient decreases. Despite this, with y equal 29.89 the mean log currency premium 
generated by the PIPO model remains too low compared with the observed value and a lower 
value of y (a y of 14.31) is needed for the model to fit the observed currency premium. 


In the PIPO economy, the log of the growth rate in the real exchange rate is 


ae 
Aqiri = 7 (Acti — Ay) - a (Avarn (cht+1) — Avarn (h1) )- (93) 


Within the PIPO setting, agents buy contracts from insurance firms to insure against in- 
dividual shocks subject to incentive constraints. Hence, there are two opposite effects: the 
precautionary saving effect and the incentive effect. When the coefficient of relative risk aver- 
sion is lower than 1, the precautionary saving effect dominates and therefore the home country 
currency appreciates with the increase in the home country uninsurable risk (i.e., the increase 
in the value of Avarp (cnz+1)). The higher the agent’s aversion to risk, the greater the incentive 
effect, so that, when 7 exceeds 1, the incentive effect dominates the precautionary effect, what 
results in the depreciation of the home country currency. 

As we can see from Table II, the log of the growth rate in the real exchange rate, Aq+1, is 
negatively correlated with the term (Avarp (cnt41) — Avaran (c) .,1)) and hence the new (com- 
pared with the CRS model) term in the right-hand side part of equation (93) can help explain 
volatility of the log of the growth rate in the real exchange rate only if y < 1, while the value of 
the risk aversion coefficient greater than 1 is required to explain the equity premium. 

To check whether the results are different for the DI model, observe that in the DI environ- 
ment the restrictions on the expected equity premium, risk-free rate, currency premium, and 
the log of the growth rate in the real exchange rate are as follows. 


The expected log excess return on the market portfolio over the risk-free rate is 


2 


Ft [rM t41 — Tft] = — Svar (rut+i)t+ycou (rM t+1, Acts) — A cov (TM +1, Avarn (Ch t+1)).- 
(94) 
The equation for the log risk-free rate is 
2 2 
rj = ~in (p) — Evar (Aci) +E [Acn] — EE [Avar (ene)] 


2 
+ 2 
— Lec tie (Avarn (cht41)) + 


yaota? 
3 147) ou (Act+1, Avarp (Ch t+1))- (95) 
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For the expected log currency premium, we have 


1 
Es [s1441 — s2441] = —5 (var (sit41) — var (s2,241)) 

+7 (cov (81,241, Act+1) — cov (82,441, Act+1)) (96) 
JEY 

=z (cov (s1,t+1, vary (cht41)) — cov (52,t+1, AVarn (Ch,t+1))) 

and, finally, the real exchange rate equation is 
; JEY r 
Aq = 7 (Aci = Aci 1) in a (Avarp (Chyt+1) — Avarp, (Ch t41)) . (97) 


In the DI framework, higher home country uninsurable risk (that results in a higher value of 
Avarp, (Cnt41)) raises the precautionary demand for both traded and non-traded goods, which 
makes the home country currency appreciate.” As in the case of the PIPO model, within the 
DI setting in the absence of uninsurable risk the equation for the real exchange rate reduces to 
the Backus-Smith (1993) specification. 

It is observed that at y equal to 593.68 (the value of the risk aversion coefficient, at which 
the CRS model fits the observed mean log excess return on the market portfolio), the mean log 
equity premium produced by the DI model is very low. As y decreases, the mean log excess 
market portfolio return generated by the model increases, but always remains below the observed 
value for any non-negative value of y, so that, in contrast with the PIPO model, there is no 
value of the risk aversion parameter, at which the DI model fits the observed mean log equity 
premium. This result is in the line with KP (2009).?+ 

For the DI model, the effect of uninsurable risk on the log risk-free rate depends on the sign 
of the sum of the fourth through six terms in the right-hand side of equation (95). At high 
values of the risk aversion parameter y, the log risk-free rate generated by the DI model is very 
low and increases as y decreases. With the moments reported in Table II and £ set to 1, the DI 
model fits the observed mean log risk-free rate at y = 1.02. This is due to the positive effect of 
the fourth and sixth terms in the right-hand side of equation (95), which overweight the negative 
effect of the fifth term at any y > 0.54. 

The mean log currency premium generated by the DI model is implausibly large at high 
values of the relative risk aversion coefficient, but decreases when y decreases. However, with 
the moments reported in Table I, there is no positive value of y, at which the DI model fits the 
observed mean log currency premium. 

As of the equation for the log of the growth rate in the real exchange rate, it may be seen from 


equation (96) that, because the term (Avary (cn,t+1) — Avarn(cj, ,,1)) in the right-hand side of 


20 Since y > 0, for any value of y the home country currency depreciates (appreciates) in response to the increase 
in the foreign (home) within-country variance in the DI environment. In the PIPO model, the implication is the 
same if y < 1 and is exactly reverse as long as y > 1. 

21 When using calibration, KP (2009) find that the PIPO model can explain the observed mean equity premium 
with a risk aversion parameter between 5 and 6, while there is no value of the relative risk aversion coefficient 
that enables to explain the mean excess return on the market portfolio in the DI environment. 
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this equation is negatively correlated with Aq41 and the coefficient of this term is negative 
for any y > 0, this new term may have significant additional explanatory power beyond the 
difference in the log growth rates of aggregate per capita consumption of respective countries in 
explaining volatility of the log of the growth rate in the real exchange rate. 

The results reported above suggest that there is no pricing kernel that is able to explain 
simultaneously all the considered asset pricing anomalies. The PIPO model has the potential 
to solve the equity and currency premia puzzles, while the DI model is likely to outperform the 
CRS model in explaining the risk-free rate and the log of the growth rate in the real exchange 
rate. It is especially unlikely that there is a value of the risk aversion parameter that is the 
same for all the equations and that allows to explain all the puzzles simultaneously. In this 
situations, it is more reasonable to look for the pricing kernel that enables to explain the asset 
pricing anomalies as better as possible (not necessarily exactly). 

Note that the approach we used above has some limitations due to the assumptions of 
joint conditional log normality and homoskedasticity we made to derive the restrictions on the 
equity and currency premia and the risk-free rate implied by each model. In this context, 
the GMM estimation technique seems to be more appropriate. The advantage of the GMM 
approach is that, when implementing this technique, it is not necessary to make any assumptions 
about the distributions of the variables in the Euler equations. In contrast to calibration, the 
GMM technique enables to estimate conditional (and not only unconditional, as in the case of 
calibration) Euler equations, as required by economic theory. The use of additional orthogonality 
conditions as well as taking into consideration time-series properties of the variables in the 
estimated equations and possible correlation of the error terms in the equations may dramatically 
affect the inferences about the relative performance of each model.?? 

In the next section, we assess the empirical performance of each of the stochastic discount 
factors considered in this paper using the GMM estimation approach. When using the GMM 
technique, we look for the values of the risk aversion coefficient and the subjective time discount 
factor that make the sample analogs of the orthogonality conditions implied by a stochastic 
discount factor as close as possible to zero. This helps us to choose a pricing kernel that is 
the best (among the considered pricing kernels) in the sense of minimizing (at the values of the 
parameters that are the same for all the equations) the difference of the sample analogs of the 
orthogonality conditions associated with the stochastic discount factor from their theoretical 


values implied by economic theory. 


22Tt is worth noting that calibration is a special case of the GMM estimation when the set of instruments has 
a constant only and is therefore equivalent to the unconditional GMM. 
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4.3 The GMM Estimation and Testing Results 


We consider three alternative environments, namely the CRS, DI, and PIPO environments. 
When assessing the performance of the candidate stochastic discount factors, we first estimate 
the parameters of each model using the GMM estimation technique and investigate whether the 
pricing kernel under consideration is able to make the moment conditions as close as possible 
to zero at economically plausible values of the agent’s preference parameters. Then, we check 
the predictive ability of each stochastic discount factor at the values of the parameters obtained 
from the GMM estimation. 

For each environment, we jointly estimate four equations. These equations are the real 
exchange rate equation and the Euler equations for the equity premium, the risk-free rate of 
return, and the currency premium. Hence, in contrast with KP (2007, 2009), who test the 
ability of the PIPO pricing kernel to explain the exchange rate puzzle (KP, 2007) and the equity 
premium puzzle (KP, 2009) in isolation, in each environment we address four puzzles (i.e., the 
consumption-real exchange rate, equity premium, risk-free rate, and currency premium puzzles) 
jointly in an integrated framework with the stochastic discount factor, which is the same in all 
the equations. 

Another advantage of our approach is that instead of using calibration, as in KP (2009), for 
example, we estimate the models using the GMM estimation technique. As we argued above, 
this allows us to estimate the parameters of not only unconditional Euler equations, as this is the 
case with calibration, but also conditional Euler equations, as required by economic theory. The 
use of additional restrictions, implied by the condition that the forecast errors associated with 
Euler equations are uncorrelated with any variables that are in the agent’s information set (a set 
of instruments), may significantly affect the estimation results and therefore inferences about 
the appropriateness of each of the considered models. Moreover, the use of the GMM estimation 
approach enables us to calculate the confidence intervals for the values of parameters, what is 
impossible when calibration is used. One more advantage of the use of the GMM is that this 
approach, in contrast with calibration, allows to take into consideration time-series properties of 
the variables in the equations as well as possible correlation of the error terms in the estimated 
equations. 

In estimation, we use five different sets of instruments. These sets are the same as in Epstein- 
Zin (1991). The first set has a constant and the US aggregate per capita consumption growth 
rate lagged one and two periods.?? The second set has a constant, the US aggregate per capita 
consumption growth rate lagged one period and the return on the market portfolio (proxied by 
the value-weighted return on all stocks listed on the NYSE, AMEX, and NASDAQ) lagged one 


3 As argued by Hall (1988), the second lag in instrumental variables helps in reducing the effect of time 
aggregation. Furthermore, Ogaki (1988) demonstrates that the use of the second lag is consistent with the 
information structure of a monetary economy with cash-in-advance constraints. 
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period. The third set has a constant, the US aggregate per capita consumption growth rate 
lagged one period and the return on the market portfolio (proxied by the equal-weighted return 
on all stocks listed on the NYSE, AMEX, and NASDAQ) lagged one period. The fourth and 
fifth sets are our respectively second and third sets of instrumental variables lagged an additional 
period. 

To check whether the estimation and testing results are robust to the chosen measure of 
consumption and the sample selection criteria, we estimate the DI and PIPO models for the 
period from 1982:Q1 to 1999:Q4 using both the KP (2007, 2009) and BSW data on household- 
level consumption expenditures. Estimating the model parameters for the sample period that 
extends from 1982:Q1 to 2004:Q4 as well as for the subsample corresponding to the sample 
period used by KP (2009) ending in 1999:Q4 enables to investigate whether the results are 
robust to the time period under consideration. 

Below, we report the GMM estimation and testing results for each of the three considered 


pricing kernels. 


The CRS Model. This is our benchmark model. Recall that the stochastic discount factor 
for this environment from the home country’s perspective is given by (84) and hence, under 
the assumption of complete risk sharing, we jointly estimate the Euler equation for the equity 


premium as 


Et 


C \ _ 
Zi (Ruts — Rrtyi)| = 09, (98) 
t 


the Euler equation for the risk-free rate as 


a -y 
a( a) RF t+ 


t 


E; =1, (99) 


the Euler equation for the currency premium as 


Cr \ (R — Si41)P 
E ( a) (Fi = Six) Pe} _ (100) 
Cr SePryi 
and the regression equation for the real exchange rate as 
Aq = 7 [Acti iE Acts1] + Cet (101) 


Table III reports the GMM estimation results. We find that the estimate of the relative risk 
aversion coefficient, that makes the moment conditions as close as possible to zero, is within 
a conventional range of values and statistically different from zero at any conventional level of 
significance. The point estimate of the subjective time discount factor is always greater than 1 
and, in most cases, statistically different from 1 at the 5% significance level. The model is not 


rejected statistically by Hansen’s test of overidentifying conditions at the 5% level of significance. 
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These results are robust to the used set of instruments, the proxy for the market portfolio, the 


time period, and the data set used in estimation. 


[ Table IH ] 


The PIPO Model. Under the assumption that agents can insure consumption using the do- 
mestic financial markets, but, due to private information about agents’ skill shocks and their 
work effort, financial intermediaries strike incentive compatible contract, which prevents com- 
plete risk sharing, the pricing kernel is given by (83) and therefore we jointly estimate the Euler 


equation for the excess return on the market portfolio over the risk-free rate as 


a y-7 
exp ( 5 Avarp, (enesa)) (Rutt — RrFt+1)| = 0, (102) 


the Euler equation for the risk-free rate as 


Cet k y= 7? = 
E |B a exp 5 Avarp (cat+1) | Rriti] =1, (103) 
t 
the Euler equation for the currency premium as 
j (2 Ge ) (Ft — Sti) Pt 
E — ex Avar, (c a = 0 104 
t ( CG, P 5) h ( hyt+1) SPa (104) 


and the regression equation for the log real exchange rate growth between the home and foreign 


countries as 
2 


Aqiri = 7 [Ac — Acts] + I= (Avarn (ch t41) — Avarn (cnt41)) + 41. (105) 


Table IV shows that the estimate of the coefficient of relative risk aversion for this model is 
slightly lower than the estimate for the CRS model and is also statistically different from zero 
at any conventional level of significance. The PIPO model is not rejected by Hansen’s test of 
overidentifying conditions at the 5% level of significance. These results are similar to the results 
we obtained for the CRS model. Another common feature of these two models is that in the 
PIPO model the subjective time discount factor is also estimated to be greater than 1, but, in 


contrast with the CRS model, is not statistically different from 1 in most cases. 


| Table IV ] 


The DI Model. Because in the DI environment the stochastic discount factor is given by 


(82), we jointly estimate the Euler equation for the equity premium as 


Et 


Cı 


Cui\ +7? 
( s) exp (2 = Avarp (nea) ) (Rutt — RrFt+1)| = 0, (106) 
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the Euler equation for the risk-free rate as 


G =y +72 
Et a( a) exp (2 n Avar (Cht+i) | Rre | =1, (107) 
t 
the Euler equation for the currency premium as 
Cay 2 F; — St41)P. 
E ( ta) exp e Avar, (enen) ) (Fi Si) t| =0 (108) 
Ct 2 StPt+1 


and the regression equation that relates the log real exchange rate growth to individual con- 


sumption as 


Y+ 


2 
Aq = 7 [Aci — Ac] + —— (Avarn (ha) — Avar, (cn,t+1)) + Nipa: (109) 


As follows from Table V, although the relative risk aversion coefficient is often estimated 
imprecisely, it is always in the conventional range. Likewise the CRS and PIPO models, the 
DI model is not rejected by Hansen’s test of overidentifying conditions at the 5% significance 
level. However, in contrast with the previous two models, the DI model yields the estimates of 
the subjective time discount factor that are (except a few cases) close to but lower than 1, as 


required by economic theory. 


| Table V | 


4.4 The Predictive Ability of the Models 


The goal of this section is to investigate whether the stochastic discount factors considered in 
this paper can fit the observed fluctuations of the real exchange rate, the equity premium, the 
risk-free rate of return, and the currency premium at the values of the parameters obtained in 
the GMM estimation above. 

We use the property that for any asset i 


E [SD Fiyi Ri t+1] = (110) 


By taking unconditional expectations of the both sides of equation (110) and lagging one 


period (to simplify notations), we obtain 
E[SDF,R;4] = 1. (111) 
For the excess return on the market portfolio over the risk-free rate, 
E (SDF; (Rust — Ret) = 0. (112) 
Because 


E [SDF; (Rms = Rrt)] = COU (SDF, Rut = Rre) + E [SDF] E [Rm = Rra] 5 (113) 
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we can write for the equity premium 


cov (SDF, Rutz — Ree) 


E (Rut Rez] = E (SDF) (114) 
Equivalently, for the currency premium, we obtain 
> ETERS see) E cov (SDF, | ies 
Sti Pt E [SDF] 
For the risk-free rate, 
E [Rr:] = 1 — cov (SDF;, RF+) (116) 


E|SDF;] 

To check the ability of a candidate pricing kernel to predict the mean equity premium, for 
example, we first calculate the average excess return on the proxy for the market portfolio?4 
over the risk-free rate for the first 30 quarters using equation (114), when the value of the 
pricing kernel for each quarter within the 30-quarter sample period is calculated at the GMM 
estimates of the parameters reported in the previous section.” Then, we move one quarter 
ahead and calculate the average equity premium generated by a model over quarters 2 to 31, 
etc. Repeating this rolling sample procedure, we obtain 62 estimates of the average (over 30 
quarters) excess returns on the market portfolio over the risk-free rate. To assess the quality of 
fitting, we use two different measures of goodness-of-fit. The first measure is the coefficient of 
correlation between the value of the mean excess return predicted by the model and the value 
observed in the data. The second measure used in the paper is the mean squared error (MSE) of 
prediction. We implement the same procedure to assess the ability of each candidate stochastic 
discount factor to fit the observed fluctuations of the risk-free rate (using equation (116)) and 
the currency premium (using equation (115)). 

For the log real exchange rate growth between two countries, we directly use the associated 
equation and estimate for each quarter the predicted log growth in the exchange rate as the 
value of the right-hand side part of the equation calculated at the GMM parameter estimates. 
Because we can do this for each quarter within the time period from 1982:Q1 to 2004:Q4, we 
generate 91 predicted values (one value per quarter). After that, as for the equity premium, the 
risk-free rate, and the currency premium, we calculate the coefficient of correlation between the 


predicted and observed values as well as the MSE of prediction. 


24 As proxies for the return on the market portfolio, we use the value- and equal-weighted returns on all stocks 
listed on the NYSE, AMEX, and NASDAQ. 

25 The choice of the length of a sample period is somewhat arbitrary. We choose it to be equal 30 quarters for 
the following reasons. The first reason is that this length allows us to quite accurately estimate the population 
mean leaving the opportunity to get a sufficiently long time-series of the estimates of the mean values. The second 
reason is that this length is comparable with the average duration of business cycles since November 1982 until 
December 2004. As documented by the US National Bureau of Economic Research (NBER), there were three 
business cycles with troughs in November 1982, March 1991, and November 2001 with the average duration of 
28.4 quarters (trough from previous trough). 
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Table VI reports the results. Panel A shows the testing results when the proxy for the market 
portfolio is the CRSP value-weighted index. In Panel B, we report the results for the CRSP 


equal-weighted index. 
| Table VI ] 


When comparing the relative performance of the considered stochastic discount factors, we 
can see that for the both proxies for the market portfolio the PIPO model is very close to 
the CRS model in its ability to predict the risk-free rate. The both models predict the mean 
risk-free rate of return, which is highly positively correlated with the observed mean risk-free 
rate and is characterized by a low value of the MSE. The CRS and PIPO models both fail 
to accurately predict the mean equity premium. For the both models, prediction is negatively 
correlated with the observed value when the CRSP value-weighted index is used as a proxy for 
the market portfolio. The PIPO model performs even worse than the CRS model when the 
proxy is the CRSP equal-weighted index. Although the MSE for the mean currency premium is 
similar for the both models, the CRS model outperforms the PIPO model in terms of correlation 
between the predicted and observed mean currency premia. For the CRS model, the coefficient 
of correlation is much greater and hence this model is able to better predict the direction of 
movement of the mean currency premium. The both models fail to predict the log of the growth 
rate in the real exchange rate. The values generated by the both models are negatively correlated 
with the values observed in the data. The performance of the PIPO model is even worse than 
that of the CRS model. The coefficient of correlation for the PIPO model is much lower than 
the correlation coefficient obtained for the CRS model. 

In contrast with the CRS and PIPO models, the predictive ability of the DI model is not 
sensitive to the used proxy for the market portfolio return. The DI model works much better 
than the other two models in predicting the mean excess return on the market portfolio over the 
risk free-rate. The generated by the DI model and observed mean values of the equity premium 
are always positively correlated, while they are positively or negatively correlated, depending on 
the used proxy for the market return, for the CRS and PIPO models. Although the MSE for 
the mean currency premium for the three models is practically the same, the DI model yields 
the predicted value that is highly positively correlated with the observed value. The DI model 
also significantly outperforms the CRS and PIPO models in predicting the log of the growth 
rate in the real exchange rate. This is the only (among the considered) model that generates 
the log of the growth rate, which is positively correlated with the value observed in the data. 
This result is not surprising given the intuition presented in Section 4.2. Perhaps the only point, 
at which the DI model is slightly outperformed by the CRS and PIPO models is its ability to 
predict the mean risk-free rate. As for the other two models, the predicted mean risk-free rate 


is highly positively correlated with the observed mean value, but the MSE for the DI model is 
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higher than for the other two models. 

Figures 1-3 illustrate graphically the performance of each of the three models when the 
CRSP value-weighted index is used as the proxy for the market portfolio and the estimates of 
the parameters are those obtained for the instrumental variable set INST1.2° The black line in 
each figure presents the observed values and the grey line presents the values generated by a 
model. In the figures for the mean equity premium and the mean currency premium, the left 


vertical axis is for the observed value and the right vertical axis is for the predicted value. 
| Figures 1-3 | 


As can be seen from Figures 1-3, although the DI model generates the mean currency pre- 
mium, which is more correlated (compared with the CRS model and, especially, the PIPO model) 
with the observed mean currency premium, it is still unable to replicate the highly volatile ex- 
cess return on currency at an economically realistic estimate of the risk aversion parameter and 
hence is still unable to solve the currency premium puzzle. The evidence is also that the DI 
model outperforms the CRS and PIPO models in predicting the log real exchange rate growth. 
The predicted and observed values of the log of the growth rate in the real exchange rate are 
positively correlated (instead of being negatively correlated, as in the case of the CRS and PIPO 
models), but the observed real exchange rate is more volatile than predicted by the DI model, 
so that the consumption-real exchange rate anomaly still remains a puzzle. Despite the fact 
that, in contrast with the CRS and PIPO models, the DI model yields plausible estimates of the 
agent’s preference parameters when the four anomalies are considered jointly in an integrated 
framework, the mean equity premium predicted by this model is very close to zero. This is the 
sign of the equity premium puzzle. It must however be noted that the predictive ability of the 
three candidate pricing kernels is quite good when these stochastic discount factors are used to 


predict the risk-free rate. 


4.5 Measurement and Sampling Errors in the Consumption Data 


In estimation, we use the home and foreign country first-differences of the cross-sectional vari- 
ances of log consumption calculated from quarterly data on individual consumption expendi- 
tures. In this section, we investigate how the first-differences of the cross-sectional variances and 
therefore the GMM estimation and testing results might be affected if individual consumption 
is measured with error. As in KP (2007, 2009), we focus our attention on the implications of 


the measurement and sampling errors in individual consumption. 


6 Because, as it is shown in Table VI, the properties of the values generated by each of the considered models 
only slightly depend on the used set of instruments, we report the figures for only one set, namely INST1. The 
results for the other sets of instrumental variables are similar. 
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Assume first, as in Vissing-Jorgensen (2002), that an observation error in the consumption 


level, €px,2, is multiplicative and unbiased, i.e., 
_ ArT 
Chkt = Chk,tEhk,t> (117) 


where Chk and Chet are respectively the observed and true consumption of household h in 
country k in period t, and the observation error €),, t is independently and identically distributed 
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across households with mean 1 and variance varp (Ehk t), Ehkt ~ TID (1, varn (Enx,t)); 
independent of the true consumption level Chet: 

Since the exact form of the observation error is typically unknown, following Vissing-Jgrgensen 
(2002), assume that €x t is independently and identically log-normally distributed (in the cross 
section), In (Enk t) ~ TIDN (upt varn (In (Eng,t))) - 


Because the observation error is independent of the true consumption, 


VATh (Chk t) = VAT (chea) + varp (In (Enk,t)) (118) 
and hence 
Avarp (Chit+1) = Avarn (Chk t41) + Avar (In (Engt+1)) - (119) 


The assumption that the measurement error €k, is homoskedastic over time implies that 


In (Enk,t) is stationary over time with mean py, = In(2/\/1+ varp (Enk t) /4) and variance 
varp (In (Enk t)) = In (1+ varp (Enk t) /4). From this, it follows that Avar, (In (Enk t+1)) = 0 
and therefore 


Avarp, (Chk t41) = Avarp (chrt) (120) 


for any t. 

This suggests that the observation error of the form assumed here does not affect the first- 
differences of the sample cross-sectional variances of log consumption and therefore does not 
affect the estimation and testing results. 

The situation may be different if ¢,,4 is not homoskedastic over time and thus In (E€hk,t) is 
not time stationary. Assume that varp (In (Enk t)) is a (0 < a < 1) times varp (Chk). In this 
case, 


Avarn (chrii) = (1 — a) Avar, (Chk +1) É Avarn (Chkt41) (121) 


meaning that, in the presence of measurement error, the estimates of the relative risk aversion 
coefficient and the subjective time discount factor will both be biased if the observed (and not 
true) consumption levels are used to calculate the first-differences of the cross-sectional variances 
of log consumption. 

The estimates of the cross-sectional variance of log consumption are generally subject to 


sample-to-sample variation (sampling error). This raises the question of the admissibility of using 


?T See also Brav et al. (2002). 
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the sample variance instead of the population variance when estimating the model parameters. 
Note that in our data set in each quarter the number of observations on household consumption 
expenditures is between 700 and 1400. The law of large numbers implies that, when the number 
of households in the cross section in each period goes to infinity, the sample variance converges 
to the population variance (the sample variance is a consistent estimator of the population 
variance). It follows that, with such a large number of observations in the cross section, we may 
expect the sample cross-sectional variance to stay close to the population variance and hence 
may expect sampling error not to affect the estimation and testing results.?° 

Because, in contrast with sampling error, the observation error in individual consumption 
biases the estimates of the parameters of interest when the observation error of the form assumed 
here is heteroskedastic over time, we assess the magnitude and direction of this bias by setting 
a equal to 0.3 and reestimating the DI and PIPO models with Avara (chet) = 0.7Avarp (Chk) 
for all t using the BSW data set for the period from 1982:Q1 to 2004:Q4. The estimation results 
are reported in Table VII. We find that, when observation error is taken into consideration, 
the estimation results for the both models become closer to the results obtained for the CRS 
model. This finding is not surprising given that, as we argued above, when the first-differences 
of the cross-sectional variances of log consumption converge to zero, both the DI and PIPO 
models converge to the CRS model. The three models coincide when the first-differences of the 
cross-sectional variances of log consumption equal zero for all t. This means that if observation 
error is large and properly taken into consideration, then the estimates of the parameters of the 
DI and PIPO models might be very close to (perhaps even indistinguishable from) the estimates 
obtained for the CRS model. 


[ Table VII | 


In this paper, we do not directly investigate whether trimming and winsorizing affect the 
estimation results for the PIPO and DI models, as it is done in Kollmann (2009). However, 
the intuition is that, since trimming and winsorizing both reduce the volatility of the cross- 
sectional distribution of log consumption, we may expect them to also reduce the changes in the 
cross-sectional variances of log consumption in both the home and foreign countries. This case 
is very close to the case we considered above when estimated the PIPO and DI models with 
Avarn (Chet) = 0.7Avarp (Chk t) and hence we may expect that trimming and winsorizing would 
make the estimation and testing results for the PIPO and DI models closer to the results for 
the CRS model. 


28 As mentioned in KP (2007), sampling error might be a serious problem when estimating the parameters of 
the pricing kernel calculated as the ratio of the yth uncentered moments of the cross-sectional distribution of 
consumption in periods t and t+ 1. 
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5 Conclusion 


This paper addressed a few extant domestic and international financial markets anomalies, 
namely the equity premium, risk-free rate, consumption-real exchange rate, and currency pre- 
mium puzzles. We investigated the potential of two stochastic discount factors, which allow 
incomplete risk sharing in economies with consumer heterogeneity, to resolve these anomalies 
in an integrated framework. The first stochastic discount factors is the DI pricing kernel. This 
stochastic discount factor describes the market structure with domestically incomplete financial 
markets, where idiosyncratic privately observed shocks are uninsured, while sequential trade in 
assets enables agents to partially hedge publicly observed shocks. The second stochastic discount 
factor is the PIPO pricing kernel that describes the market environment in which both private 
and public shocks are insured subject to truth revelation constraint by agents. Assuming cross- 
sectional log normality of individual consumption levels, we derived the closed-form formulas for 
these two pricing kernels. The obtained stochastic discount factors are new to the literature. 

We tested empirically both these stochastic discount factors using household-level data on 
consumption expenditures from the US and the UK. In contrast with KP (2007, 2009), when 
assessing the empirical performance of each candidate stochastic discount factor, we addressed 
the above-mentioned four puzzles in an integrated framework with the same pricing kernel in all 
the equations. Using the GMM estimation technique, we found that, when the Euler equations 
for the equity premium, the risk-free rate of return, and the currency premium, as well as the 
regression equation for the log growth in the exchange rate are estimated jointly, the asset pricing 
implications of the PIPO model are very close to those of the CRS model. In contrast with the 
CRS and PIPO models, the DI model makes the sample analogs of the orthogonality conditions 
as close as possible to zero at economically plausible values of both the relative risk aversion 
coefficient and the subjective time discount factor. When testing the predictive ability of these 
pricing kernels, we however found that although the DI model outperforms the CRS and PIPO 
models, it is unable to solve the above-mentioned puzzles when they are considered jointly in 
an integrated framework. This might be due to a low time-series volatility of the change in 
the cross-sectional variance of the log of individual consumption by means of which the agents’ 
heterogeneity is taken into account in both the PIPO and DI pricing kernels. 

This result suggests that although heterogeneity in individual consumption plays an impor- 
tant role in explaining asset returns, it is unable to resolve the asset pricing anomalies in an 


integrated framework when taken into account alone. 
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Table I. 
Descriptive Statistics. 


Variable Min Median Max Mean SD Skew Kurt JB 


A. 1982:Q1 - 1999:Q4 


Cr4i/C; 0.9926 1.0056 1.0126 1.0055 0.0038 -0.6077 3.9229 6.6960 

ee Oe 0.9893 1.0080 1.0291 1.0084 0.0088 0.1374 2.7390 0.4130 
Rywt+1 0.7622 1.0410 1.2066 1.0373 0.0793 -0.4887 4.3604 8.0667 
Rew,t+1 0.7023 1.0253 1.3160 1.0330 0.1090 -0.0842 3.6424 1.2679 
RFt+1 0.9998 1.0066 1.0166 1.0066 0.0037 0.4976 2.9751 2.8496 
Agi -0.1608 0.0020 0.1487 0.0006 0.0565 -0.1246 3.1078 0.2120 
Egun -0.1618 -0.0052 0.1321 -0.0065 0.0558 -0.0939 2.8951 0.1331 
Ri t+ 0.7308 1.0463 1.2425 1.0387 0.0933 -0.4652 4.3440 7.6815 
R2 t+1 0.7906 1.0391 1.1809 1.0322 0.0617 -0.9079 5.5732 28.5149 
R341 0.7649 1.0412 1.3399 1.0454 0.0981 0.3794 4.5972 8.9894 
Ratqi 0.7533 1.0433 1.2421 1.0404 0.0983 -0.3088 2.9683 1.0993 
R5,t41 0.7600 1.0409 1.2070 1.0377 0.0935 -0.8229 4.0735 11.1003 


The Kocherlakota-Pistaferri Data Set 


Avar;,(Cnt+1) -0.0832 0.0021 0.0528 0.0004 0.0248 -0.4784 3.5101 3.3797 
Avarn (Ch t41) -0.1373 0.0039 0.1086 0.0017 0.0392 -0.4294 4.7849 11.2804 


The BSW Data Set 


Avarn(Ch +1) -0.0505 -0.0015 0.0636 -0.0001 0.0227 0.2079 2.8841 0.5355 
Avarn (C441) -0.1482 -0.0008 0.1298 0.0020 0.0443 -0.4190 4.8169 11.5099 


B. 1982:Q1 - 2004:Q4 


Cr41/C; 0.9926 1.0056 1.0126 1.0053 0.0036 -0.4920 3.7453 5.6509 

Pale; 0.9893 1.0073 1.0291 1.0082 0.0084 0.1521 2.8793 0.3971 
Rywt+1 0.7622 1.0327 1.2066 1.0278 0.0856 -0.4832 3.5251 4.4864 
Rew,t+1 0.7023 1.0262 1.3160 1.0342 0.1157 -0.0777 3.1646 0.1899 
RFt+1 0.9917 1.0052 1.0166 1.0050 0.0049 -0.4022 3.3181 2.7753 
Aqiz1 -0.1608 0.0016 0.1487 0.0016 0.0529 -0.1290 3.2866 0.5514 
ES -0.1618 -0.0057 0.1321 -0.0081 0.0523 -0.0643 3.0450 0.0688 
Ri t+ 0.7308 1.0397 1.2425 1.0319 0.0910 -0.3438 4.0195 5.6071 
R2 t+1 0.7906 1.0346 1.1809 1.0287 0.0672 -0.9112 4.7373 23.5081 
R3 441 0.6786 1.0339 1.3399 1.0282 0.1220 -0.3723 3.9002 5.0612 
R4 t+1 0.7533 1.0301 1.2421 1.0321 0.0972 -0.2595 3.0661 1.0151 
Rst41 0.7600 1.0372 1.2070 1.0323 0.0939 -0.6661 3.5549 7.7244 


The BSW Data Set 


Avarn(Ch +1) -0.0505 -0.0024 0.0636 -0.0006 0.0222 0.3099 2.7978 1.5764 
Avarn (Ch t41) -0.1482 -0.0013 0.1298 0.0022 0.0436 -0.2116 4.5216 9.2506 


Note: JB is the Jarque-Bera statistic. 
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Table II. 
Moments of the Variable Distributions. The BSW Data Set, 1982:Q1-2004:Q4. 


Covariance with 


Variable Mean Variance AAC+ rvwt+1  Avarn(crt+41) 
Acty1 0.005218 0.000013 0.000013 0.000038 0.000001 
TVW,t+1 0.024120 0.007295 0.000038 0.007295 0.000050 
TFt+1 0.005160 0.000026 0.000003 0.000037 0.000002 
Avarn(Ch +1) -0.000606 0.000493 0.000001 0.000050 0.000515 
S1 t+1 -0.012942 0.000045 0.000005 0.000011 -0.000003 
$2t41 -0.006945 0.002645 -0.000019 -0.000653 0.000077 


Correlation with 
Variable Mean Variance Acı — Aci, , Avarn(Ch +1) — Avara(G t41) 


Agi 0.001623 0.002795 -0.022877 -0.175575 
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